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Background / Context: 

School leadership and a eulture of trust are widely reeognized as important in promoting in- 
sehool proeesses and eonditions that support and inerease student learning and aehievement 
(Louis et ah, 2010; Robinson et ah, 2008; Supovitz et ah, 2009, Bryk et al, 2010). A meta- 
analysis of 70 studies found effect sizes ranging from 0.16 to 0.33 between teacher perception of 
sehool leadership and student aehievement (Waters et ah, 2003). Robin s on et al. (2008) analyzed 
22 studies and eoneluded the average effeet of instruetional leadership is three to four times 
greater (effeet size of 0.44) than the effect of transformational leadership on student outeomes. 
Reeent empirieal work has supported the eonneetions between sehool leadership, teacher 
praetiees, and improved student outeomes (e.g.. Day, et al. 2009;, Heek and Hallinger 2010; 
Goddard, et ah, 2010; May and Supovitz, 2011; Sammons, et ah, 2010; Supovitz, Sirinides, and 
May, 2010). These studies show eonsistently positive assoeiations between leadership and 
student outcomes aeross a variety of organizational (e.g., elementary, middle, and high sehools; 
publie, private, and eharter), regional (e.g., urban/suburban), and, temporal eontexts (1980 
through present). 

Although the strengths portrayed by these studies are eonsistent, so too are the weaknesses. The 
extant published empirieal work on leadership effeets is exelusively (a) eorrelational, (b) 
primarily eross-sectional, and (e) non-interventional. Positive findings from eorrelational studies 
provide a persuasive promise, while the lack of rigorous causal leadership studies provides a 
eompelling need for rigorous evaluations. This study examined a feedbaek in a randomized 
design and a eoaehing intervention eombined with feedbaek in a non-randomized design. 

Purpose / Objective / Research Question / Focus of StudyrThe purpose of this study is to evaluate 
the effieaey of a feedbaek and eoaehing intervention designed to improve the quality of prineipal 
leadership. Prineipals reeeived feedbaek from teaehers on their instruetional leadership, and their 
teachers’ trust of them. Prineipals also provided self-ratings and they eompared their teachers' 
ratings to their own self -ratings. In the first part of the study they received feedbaek reports of 
these data. In the seeond part they worked with trained eoaehes on how to use and integrate that 
information into their edueational praetiee. We predieted that this intervention would improve 
principals’ leadership, trust, and, ultimately, enhance student achievement. (Student achievement 
was not evaluated in this project). The first hypothesis about feedbaek was tested in a 
longitudinal randomized design and the seeond eoneerning the addition of eoaehing in a pre-post 
design where prineipals reeeived feedbaek only in the first year and feedbaek and eoaehing in the 
seeond year. 

Feedbaek for Improving Leadership. Multi-souree feedbaek typieally entails a leader’s self- 
assessment as well as assessments from subordinates, peers, and/or superiors. This type of 
feedbaek provides information about leadership efficacy that resides within the shared 
experienees of these individuals rather than from any single souree (Atwater, 1998). Researeh in 
the private seetor supports the use of subordinate feedbaek to faeilitate communieation (Tornow, 
London & CCL Assoeiates, 1998), provide unique perspectives (Chureh, 1997), and serve as a 
reliable source of useful information (Smither et ah, 2005). Numerous studies, outside of 
edueation, have reported improved leadership and inereased leader self-awareness (e.g., Hesketh 
et ah, 2005; Reilly et al, 1996) and overall improved performanee (Kluger & DeNisi, 1996; 
Walker & Smither, 1999) following feedbaek from subordinates. 
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Coaching for Improving Leadership . Feedback alone, without supports for implementing the 
feedbaek, may fail to improve leadership for several reasons. First, individuals are often self- 
enhaneing and self-proteetive (Alieke & Sedikides, 2009) and tend to “aeeept feedbaek from 
others that is eonsistent with the way we see ourselves and to reject feedback that is inconsistent 
with the way we see ourselves” (Goldsmith, 2004, p.7). By denying the validity of the feedback, 
people relieve themselves of the responsibility to respond to it. Seeond, persons often struggle 
with interpreting feedbaek and drawing valid eonelusions (Cannon & Witherspoon, 2005). Our 
theoretical model indieated that if the prineipal did not aeeept the validity of the measures or 
feedbaek, then it would be unlikely that positive ehange would oeeur. 

The combination of feedback and coaching has become one of the fastest growing exeeutive 
development strategies in global companies (Ameriean Management Assoeiation, 2008; Luthans 
& Peterson, 2004). Business organizations frequently hire eoaehes to help managers work 
through the natural barriers to using feedbaek produetively, assisting in interpreting and 
internalizing the feedbaek as well as setting speeifie goals for change (MeDowall, 2008; Jarvis & 
Fillery-Travis, 2006). Existing researeh in business suggests that eoaching has positive results 
(Kombarakaran et ah, 2008; Bowles et al., 2007). For example, Thateh (2002) found that the 
eombination of feedbaek and eoaehing inereased leadership effeetiveness up to 60 pereent. 

These positive results suggest the value of testing a proeess of feedbaek and eoaehing in 
educational settings designed to improve prineipal leadership. Coaehing for sehool prineipals is 
beeoming a more popular approach to leadership development 

Setting: 

The experiment was implemented in a large urban sehool distriet in the Southeastern United 
States. All principals serving in the 108 regular, elementary or middle schools the sehool system 
at the start of the study were eligible to partieipate. High sehools and sehools for seleet student 
populations (e.g., alternative sehools, speeial edueation) were exeluded beeause of the unique 
nature of leadership in these eontexts. 

Population / Participants / Subjects: 

Of the 108 sehools that were eligible to participate, 76 eonsented to partieipate and were 
randomly assigned to the treatment (feedback) or control (no feedback) group. Overall, 
prineipals had been leaders at their current school on average 2.4 years. They had been principals 
in the same sehool distriet on average 5.6 years. Teaehers had been teaehing in their eurrent 
sehool for an average 1.6 years and with their prineipals for an average 1.1 years. The average 
pereentage of students in the free/redueed lunch program was 68%. Overall, 88% and 85% of 
students were profieient or advaneed on their standardized tests in reading and math, respectively 
Intervention / Program / Practice: 



The first phase of the researeh was feedbaek only. Beginning in the spring of 2008, all treatment 
and eontrol teaehers responded to a detailed survey regarding the prineipals’ learning-eentered 
leadership, and the levels of trust between prineipals and teaehers. Two to three weeks after the 
survey was administered, treatment prineipals reeeived a detailed feedback report, while the 
eontrol group received no feedback. This procedure of survey and feedback reports for the 
treatment group and survey only for the eontrol was repeated a total of 4 times in the first phase 
of the study: spring 08, fall 09, winter 09 and spring 09. Feedbaek was presented in reports that 
ineluded a narrative as well as graphs of the results of teaeher and prineipal survey responses for 
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each wave of data. Reports presented data for the results of each wave (time), as well as 
longitudinally based on the waves of accumulated data from prior survey administrations. 

Finally, data on all schools in the sample were averaged so that principals could compare their 
results to those of other participating schools (not just schools in the feedback group). 

In the second phase of the research coaching was implemented to help principals understand and 
use their feedback data. Our coaching model prescribes a progression of foci and activities to be 
implemented over five phases that unfold across up to 15 sessions in a year. Coaches scheduled 
sessions with their principals, typically in the principal’s office, with automated meeting 
reminders emailed to coaches and principals. Meetings lasted between 45-90 minutes, with 
longer sessions generally immediately following a feedback report and shorter sessions in 
between feedback reports. Rather than following a specific set of procedures or protocols for 
each session, the focus in our model is on developing principals' knowledge and skills in the 
service of developing instructional leadership behaviors and achieving outcomes. This is 
achieved by building a developmental relationship, listening, questioning, assessing, feedback 
giving, confronting, motivating, goal setting, action planning, establishing accountability, and 
supporting. The specific phases of the coaching model are: 1) Groundwork involves setting the 
stage for an effective coaching engagement; (2) Assessment and feedback help principals get 
clear pictures of themselves as educational leaders. Coaches ask questions to assess what 
principals learn from their feedback and explore its meaning and implications;. (3) Goal setting 
is the process of moving from an informed perspective to having a specific, measurable 
objective. The responsibility of the coach is to assist the principal in selecting a meaningful 
target for change and designing and committing to a goal or set of goals; (4) Action planning 
involves designing clear, concrete, and specific steps that, if followed, will lead to goal 
attainment; and (5) Ongoing assessment and support involve measuring progress over time, 
addressing challenges that emerge, providing support to build motivation, and keeping the 
principal on track. 

Research Design: 

Study I, with feedback only, was a randomized cluster design with randomization occurring at 
the school level. Study II, which was feedback plus coaching was implemented, used a quasi- 
experimental pre-post design. . 

Data Collection and Analysis: 

Feedback Alone . We tested the effect of feedback to principals, without coaching, on teacher 
perceptions of instructional leadership and their trust in their principal in 52 schools. Principals 
were randomly assigned to one of two groups; 26 principals received feedback reports based on 
surveys completed by their teachers during the year. 26 principals did not receive feedback 
(control group). About 38 teachers on average per school provided feedback data. Data from 
teachers and principals were collected in four waves: spring of 2008 and in the fall, winter and 
spring of 2008-2009. Principals in the feedback group received four feedback reports, one 
feedback report per wave. We used hierarchical linear growth models to predict growth on the 
two teachers' reports of the principal's instructional leadership and their trust of their principal. 
Teacher measures were summarized per principal for each wave (N= 208: 52 
principals*4waves). 

Feedback Plus Coaching. In the second part of the study, we examined the effects of feedback 
and coaching with 39 schools (a subset of the 52 schools in the feedback only analysis). All 39 
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principals received feedbaek only in their first three waves and feedback plus coaching in their 
following three waves, thus all prineipals reeeived feedbaek in eaeh of 6 waves. Coaehing was 
provided during the last two waves (fourth through sixth), whieh meant that the last two teaeher 
data oolleetion points were eollected after eoaehing began. On average, principals received 7 
coaching sessions for approximately an hour eaeh session (SD=2.88, range 2-15). 

We used longitudinal HLM growth models to estimate the effeet of eoaehing on the two 
outcomes, instructional leadership and teaehers’ trust of their prineipal. In this analysis we also 
used the prineipal’s pereeption of the validity of the measures/feedbaek. Our theoretieal model 
indieated that if the prineipal did not aeeept the validity of the measures or feedbaek, then it 
would be unlikely that positive ehange would oecur. The three-way interaetion measured the 
additional outeome each coaching session might produee in the last wave at different levels of 
validity. 

Findings / Results: 

Feedbaek Alone . The results of the HLM model for instruetional leadership, indieated a 
statistieally signifieant time by feedbaek interaetion effeet (p<.01) with a feedbaek effeet size of 
0.14. Providing feedbaek reports produeed a small but positive effeet size on prineipals' 
instruetional leadership. Similar results were found for relational trust in the prineipal from 
teaehers (ES=0.16). Please see Table 1 in Appendix B. 

Feedback plus Coaehing. There was no signifieant main effect for coaching (Coaching* Waved 
p-value=0.59) or signifieant two-way interaetion effeets. However, there was a signifieant three- 
way interaetion (p < .0001), suggesting that instruetional leadership ratings were higher for 
principals receiving more eoaehing sessions and whose pereeived validity of measures/feedback 
was above the mean (ES =0.34, Hedge’s g). Teaeher ratings of trust in the principal show similar 
results but with smaller ES (0.23, Hedge’s g), suggesting that trust is more diffieult to improve. 
Please see table 2 in Appendix B. 

Beeause the effeets of eoaehing and feedbaek depended on the number of eoaehing sessions 
delivered and the prineipal’s pereeived validity of measures/feedbaek (mean=3.39, SD=0.49), we 
modeled data showing the ehange of outcome and effeet sizes assoeiated with different levels of 
validity and number of coaehing sessions. When principals perceive measures/feedbaek to be 
less valid, more eoaehing sessions do not improve instruetional leadership; in faet they have a 
negative effeet. However, when the measures/feedbaek are pereeived as more valid, we obtain 
large positive effect sizes assoeiated with coaching and feedback. Eor example, in our modell5 
sessions ean ereate an effeet size ranging from -1.29 to 1.62, depending on the prineipal’s 
pereeption of validity. We also found similar results with teaehers’ prineipal trust seores. The ES 
for the average validity rating and 15 sessions is 0.88 for instruetional leadership and 0.41 for 
trust. Please see table 3 in Appendix B. 

Conclusions: 

The results indieate that teaehers’ pereeptions of instruetional leadership and trust of their 
principal can be enhanced when prineipals are provided with feedback alone. However, when 
eoaehing is added the effeets are mueh more powerful and depend on the principal’s perception 
of the validity of the feedbaek and the number of eoaehing sessions attended. This study 
provides new experimental evidenee of the effieaey of feedback and coaching as a program for 
leadership ehange and development aimed at improving instruetional leadership and teaeher- 
prineipal trust. 
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Appendix B, Tables and Figures 



Table 1. Instructional Leadership HLM Results for Feedback Only 



Fixed Effect 


Estimate 


Std. Error 


p-value 


Intercept 


3.65 


0.08 


<.0001 


Spring 


-0.18 


0.03 


<.0001 


Time (wave) 


0.01 


0.02 


0.52 


Intervention (Feedback=l) 


0.08 


0.12 


0.51 


T ime * Intervention 


0.06 


0.02 


0.01 


Variance Component^ 


Between-principals 


0.18 


0.04 


<.0001 


Within-principals 


0.04 


0.01 


<.0001 


Fit Statistics ML 








-2 Res Log Likelihood 


82.4 






Sample Size 


208 







Note: Fifty-two schools were randomly assigned into two equal groups: feedback and no feedback. We tested a 
model with two binary variables to measure separate changes in springOS and spring09 but its fit was poorer 
than using one spring variable. 

“Time was allowed to vary randomly across principals but was not significant, thus was dropped from the model. 



Table 2. Instructional Leadership & Trust in Principal HLM Results for Feedback and Coaching* 





Instructional Leadership Teachers’ Trust in Principal 


Fixed Effect 


Estimate 


Std. Error 


p-value 


Estimate 


Std. Error 


p-value 


Intercept 


3.87 


0.06 


<.0001 


3.07 


0.14 


<.0001 


Time 


-0.02 


0.01 


0.02 


-0.02 


0.01 


0.04 


Wave 6 ( Spring Yr2) 


-0.06 


0.05 


0.59 


0.04 


0.10 


0.70 


Validity (centered) 


0.02 


0.04 


0.62 


-0.06 


0.08 


0.44 


Coaching Sessions 


0.03 


0.02 


0.10 


0.03 


0.02 


0.18 


Coaching* V alidity 


0.01 


0.01 


0.57 


0.01 


0.01 


0.46 


Wave6*Coaching 


-0.01 


0.01 


0.59 


-0.02 


0.01 


0.10 


Wave6*Coaching*Validity 


0.04 


0.01 


<.0001 


0.03 


0.01 


0.01 


Variance Component 


Between-principals 


0.10 


0.02 


<.0001 


0.09 


0.02 


<.0001 


Within-principals 


0.03 


0.004 


<.0001 


0.03 


0.003 


<.0001 


Fit Statistics ML 














-2 Res Log Likelihood 


9.9 






14.8 






Sample size* 


223 






223 







*Based on 39 schools receiving feedback in three waves (fall, winter, and spring) and feedback and coaching 
in the following three waves (fall, winter, and spring of next academic year). 
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Table 3, Predicted Effect Sizes of Instructional Leadership by Number of Sessions by 
Level of Perceived Validity of Measures/Feedback 



% of Principals with 
validity scores shown 


Principals’ Validity hy # Sessions 


Predicted Change in Wave 6 


Predicted ES 


13% 


Validity [2.5-2.75] 








7 Coaching Sessions 


-0.33 


-0.67 




1 1 Coaching Sessions 


-0.48 


-0.98 




1 5 Coaching Sessions 


-0.63 


-1.29 


39% 


Validity [3.0-3.25] 








7 Coaching Sessions 


-0.19 


-0.38 




1 1 Coaching Sessions 


-0.26 


-0.53 




1 5 Coaching Sessions 


-0.33 


-0.67 


24% 


Validity [3.5-3.75] 








7 Coaching Sessions 


0.23 


0.48 




1 1 Coaching Sessions 


0.40 


0.82 




1 5 Coaching Sessions 


0.57 


1.16 


24% 


Validity Score = 4.0 








7 Coaching Sessions 


0.34 


0.69 




1 1 Coaching Sessions 


0.57 


1.16 




1 5 Coaching Sessions 


0.80 


1.62 
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